Quantitative Comparisons between Time Domain Speech Fundamental Frequency Estimation Algorithms
نویسندگان
چکیده
. T W O techn iques a r e presented here t o enable q u a n t i t a t i v e comparison o f t i m e domain fundamental f requency e s t i m a t i o n a l g o r i t h m s a g a i n s t a r e fe rence , t h a t makes use o f t h e o u t p u t f rom a laryngograph. These measures a re c a r r i e d o u t on t h e p u l s a t i l e ou tpu t s produced by t h e dev ices, where each pu lse corresponds t o an epoch o f a c o u s t i c e x c i t a t i o n due t o a v o c a l f o l d c l o s u r e . The r e s u l t s g i ven he re a r e f o r a peak-p ick ing a lgor i thm. The comparison techn iques are: 1 ) Receiver ope ra t i ng c h a r a c t e r i s t i c . Th i s i s a p l o t o f t h e p r o b a b i l i t y o f success fu l d e t e c t i o n o f a voca l f o l d c l osu re , as compared t o t h e re fe rence , a g a i n s t t h e number o f f a l s e a la rms. It i s shown t h a t t h i s measure g i v e s a c l e a r i n d i c a t i o n as t o how w e l l t h e dev ice under t e s t per forms w i t h r espec t t o t h e re fe rence , as w e l l as p r o v i d i n g a q u a n t i t a t i v e method f o r dev ice parameter o p t i m i s a t i o n . 2 ) J i t t e r d i s t r i b u t i o n . Th i s i s a h is togram o f t h e d i f f e r e n c e s i n t h e t imes o f occurence o f o u t p u t pu l ses f rom t h e re fe rence and t h e corresponding t ime -a l i gned pu lses f r om the dev ice under t e s t . Th i s measure g i ves an i n d i c a t i o n o f hovd p r e c i s e l y and c o n s i s t e n t l y dev ices a r e ab le t o l o c a t e v o c a l f o l d c l o s u r e i n s t a n t s .
منابع مشابه
Real-time fundamental frequency estimation by least-square fitting
The real-time performance of a fundamental frequency estimation algorithm depends not only on its computational eeciency but also on its ability to obtain accurate estimates from short signal segments. Previous frequency-domain algorithms make use of spectral analysis algorithms that require the application of a window function, which cause them to fail when signal segments are short and their ...
متن کاملRobust algorithms for speech reconstruction on mobile devices
This thesis is concerned with reconstructing an intelligible time-domain speech signal from speech recognition features, such as Mel-frequency cepstral coefficients (MFCCs), in a distributed speech recognition(DSR) environment. The initial reconstruction methods in this thesis require, in addition to MFCC vectors, fundamental frequency and voicing information. In the later parts of the thesis t...
متن کاملA Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement
A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...
متن کاملPitch estimation in noisy speech based on temporal accumulation of spectrum peaks
In this paper, we present a study on robust pitch estimation by integrating spectral and temporal information in speech. Spectrum harmonics are important representations of the speech fundamental frequency. Harmonic-related spectral peaks of speech evolve much more slowly than the spectral peaks of noise. This motivates the proposition of temporally accumulated peak spectrum (TAPS), which is co...
متن کاملA Spectro-Temporal Demodulation Technique for Pitch Estimation
We consider a two-dimensional demodulation framework for spectro-temporal analysis of the speech signal. We construct narrowband (NB) speech spectrograms, and demodulate them using the Riesz transform, which is a two-dimensional extension of the Hilbert transform. The demodulation results in timefrequency envelope (amplitude modulation or AM) and timefrequency carrier (frequency modulation or F...
متن کاملSmooth Cepstrum Calculation Using Modified Bartlett Hanning Window
Cepstrum is an algorithm for analyzing the speech signals in frequency domain. This is conventional method of fundamental peak picking i.e. fundamental frequency or pitch. For a speech signal it is necessary to identify the fundamental frequency correctly in order to have robust system for speaker identification and verification. Using this approach two algorithms has been proposed using Hammin...
متن کامل